hong kong university
Multi Layered Autonomy and AI Ecologies in Robotic Art Installations
Chen, Baoyang, Xu, Xian, Qu, Huamin
This paper presents Symbiosis of Agents, is a large-scale installation by Baoyang Chen (baoyangchen.com), that embeds AI-driven robots in an immersive, mirror-lined arena, probing the tension between machine agency and artistic authorship. Drawing on early cybernetics, rule-based conceptual art, and seminal robotic works, it orchestrates fluid exchanges among robotic arms, quadruped machines, their environment, and the public. A three tier faith system pilots the ecology: micro-level adaptive tactics, meso-level narrative drives, and a macro-level prime directive. This hierarchy lets behaviors evolve organically in response to environmental cues and even a viewer's breath, turning spectators into co-authors of the unfolding drama. Framed by a speculative terraforming scenario that recalls the historical exploitation of marginalized labor, the piece asks who bears responsibility in AI-mediated futures. Choreographed motion, AI-generated scripts, reactive lighting, and drifting fog cast the robots as collaborators rather than tools, forging a living, emergent artwork. Exhibited internationally, Symbiosis of Agents shows how cybernetic feedback, robotic experimentation, and conceptual rule-making can converge to redefine agency, authorship, and ethics in contemporary art.
MonoGlass3D: Monocular 3D Glass Detection with Plane Regression and Adaptive Feature Fusion
Zhang, Kai, Zhao, Guoyang, Shi, Jianxing, Liu, Bonan, Qi, Weiqing, Ma, Jun
Detecting and localizing glass in 3D environments poses significant challenges for visual perception systems, as the optical properties of glass often hinder conventional sensors from accurately distinguishing glass surfaces. The lack of real-world datasets focused on glass objects further impedes progress in this field. To address this issue, we introduce a new dataset featuring a wide range of glass configurations with precise 3D annotations, collected from distinct real-world scenarios. On the basis of this dataset, we propose MonoGlass3D, a novel approach tailored for monocular 3D glass detection across diverse environments. To overcome the challenges posed by the ambiguous appearance and context diversity of glass, we propose an adaptive feature fusion module that empowers the network to effectively capture contextual information in varying conditions. Additionally, to exploit the distinct planar geometry of glass surfaces, we present a plane regression pipeline, which enables seamless integration of geometric properties within our framework. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches in both glass segmentation and monocular glass depth estimation. Our results highlight the advantages of combining geometric and contextual cues for transparent surface understanding.
- Asia > China (0.72)
- North America > United States > California (0.68)
- Research Report > Promising Solution (0.68)
- Overview > Innovation (0.54)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
ScIRGen: Synthesize Realistic and Large-Scale RAG Dataset for Scientific Research
Lin, Junyong, Dai, Lu, Han, Ruiqian, Sui, Yijie, Wang, Ruilin, Sun, Xingliang, Wu, Qinglin, Feng, Min, Liu, Hao, Xiong, Hui
Scientific researchers need intensive information about datasets to effectively evaluate and develop theories and methodologies. The information needs regarding datasets are implicitly embedded in particular research tasks, rather than explicitly expressed in search queries. However, existing scientific retrieval and question-answering (QA) datasets typically address straightforward questions, which do not align with the distribution of real-world research inquiries. To bridge this gap, we developed ScIRGen, a dataset generation framework for scientific QA \& retrieval that more accurately reflects the information needs of professional science researchers, and uses it to create a large-scale scientific retrieval-augmented generation (RAG) dataset with realistic queries, datasets and papers. Technically, we designed a dataset-oriented information extraction method that leverages academic papers to augment the dataset representation. We then proposed a question generation framework by employing cognitive taxonomy to ensure the quality of synthesized questions. We also design a method to automatically filter synthetic answers based on the perplexity shift of LLMs, which is highly aligned with human judgment of answers' validity. Collectively, these methodologies culminated in the creation of the 61k QA dataset, ScIRGen-Geo. We benchmarked representative methods on the ScIRGen-Geo dataset for their question-answering and retrieval capabilities, finding out that current methods still suffer from reasoning from complex questions. This work advances the development of more sophisticated tools to support the intricate information needs of the scientific community.
- Asia > China > Hong Kong (0.41)
- North America > Canada > Ontario > Toronto (0.06)
- Asia > China > Guangdong Province > Guangzhou (0.05)
- (3 more...)
- Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)
A Physics-informed End-to-End Occupancy Framework for Motion Planning of Autonomous Vehicles
Shen, Shuqi, Yang, Junjie, Lu, Hongliang, Zhong, Hui, Zhang, Qiming, Zheng, Xinhu
Accurate and interpretable motion planning is essential for autonomous vehicles (AVs) navigating complex and uncertain environments. While recent end-to-end occupancy prediction methods have improved environmental understanding, they typically lack explicit physical constraints, limiting safety and generalization. In this paper, we propose a unified end-to-end framework that integrates verifiable physical rules into the occupancy learning process. Specifically, we embed artificial potential fields (APF) as physics-informed guidance during network training to ensure that predicted occupancy maps are both data-efficient and physically plausible. Our architecture combines convolutional and recurrent neural networks to capture spatial and temporal dependencies while preserving model flexibility. Experimental results demonstrate that our method improves task completion rate, safety margins, and planning efficiency across diverse driving scenarios, confirming its potential for reliable deployment in real-world AV systems.
LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning
Lyu, Tengfei, Feng, Siyuan, Liu, Hao, Yang, Hai
--Ride-hailing platforms face significant challenges in optimizing order dispatching and driver repositioning operations in dynamic urban environments. Traditional approaches based on combinatorial optimization, rule-based heuristics, and reinforcement learning often overlook driver income fairness, interpretability, and adaptability to real-world dynamics. T o address these gaps, we propose LLM-ODDR, a novel framework leveraging Large Language Models (LLMs) for joint Order Dispatching and Driver Repositioning (ODDR) in ride-hailing services. LLM-ODDR framework comprises three key components: (1) Multi-objective-guided Order V alue Refinement, which evaluates orders by considering multiple objectives to determine their overall value; (2) Fairness-aware Order Dispatching, which balances platform revenue with driver income fairness; and (3) Spatiotemporal Demand-A ware Driver Repositioning, which optimizes idle vehicle placement based on historical patterns and projected supply. We also develop JointDR-GPT, a fine-tuned model optimized for ODDR tasks with domain knowledge. Extensive experiments on real-world datasets from Manhattan taxi operations demonstrate that our framework significantly outperforms traditional methods in terms of effectiveness, adaptability to anomalous conditions, and decision interpretability. T o our knowledge, this is the first exploration of LLMs as decision-making agents in ride-hailing ODDR tasks, establishing foundational insights for integrating advanced language models within intelligent transportation systems. Ride-hailing platforms [1], [2] have revolutionized urban transportation by efficiently connecting passengers with vehicles through digital marketplaces. These platforms face complex real-time decision-making challenges, particularly in order dispatching (matching riders to drivers) and driver repositioning (strategically relocating idle vehicles) [3]. Tengfei Lyu is with the Thrust of Artificial Intelligence, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China, and The Hong Kong University of Science and Technology, Hong Kong, SAR, China (e-mail: tlyu077@connect.hkust-gz.edu.cn). Siyuan Feng is with the Department of Aeronautical and Aviation Engineering, The Hong Kong Polytechnic University, Hong Kong, SAR, China (e-mail: siyuan.feng@polyu.edu.hk). Hao Liu is with the Thrust of Artificial Intelligence, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China, and also with the Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong, SAR, China (e-mail: liuh@ust.hk).
- Asia > China > Hong Kong (1.00)
- Asia > China > Guangdong Province > Guangzhou (0.85)
- North America > United States > California (0.04)
- Asia > China > Hunan Province > Changsha (0.04)
- Transportation > Passenger (1.00)
- Transportation > Ground > Road (1.00)
From Prohibition to Adoption: How Hong Kong Universities Are Navigating ChatGPT in Academic Workflows
Huang, Junjun, Wu, Jifan, Wang, Qing, Yuan, Kemeng, Li, Jiefeng, Lu, Di
This paper aims at comparing the time when Hong Kong universities used to ban ChatGPT to the current periods where it has become integrated in the academic processes. Bolted by concerns of integrity and ethical issues in technologies, institutions have adapted by moving towards the center adopting AI literacy and responsibility policies. This study examines new paradigms which have been developed to help implement these positives while preventing negative effects on academia. Keywords: ChatGPT, Academic Integrity, AI Literacy, Ethical AI Use, Generative AI in Education, University Policy, AI Integration in Academia, Higher Education and Technology
- South America > Brazil > Pernambuco > Recife (0.04)
- North America > United States > New York (0.04)
- Europe > Switzerland (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)
AI-rays: Exploring Bias in the Gaze of AI Through a Multimodal Interactive Installation
Gao, Ziyao, Zhang, Yiwen, Li, Ling, Papatheodorou, Theodoros, Zeng, Wei
Numerous cases have demonstrated that specific appearance signals can implicitly correlate with biased social sorting, causing Data surveillance has become more covert and pervasive with AI injustice. For example, AI predictive policing overestimates recidivism algorithms, which can result in biased social classifications. Appearance risk for black people [David Robinson 2016], recruitment offers intuitive identity signals, but what does it mean engines prefer male candidates for tech jobs [Reuters 2018], and to let AI observe and speculate on them? We introduce AI-rays, AI beauty contests favor white winners [Levin 2016]. Nowadays, an interactive installation where AI generates speculative identities machine scrutiny is pervasive and constant. How do machines interpret from participants' appearance which are expressed through our appearance cues? Who is putting that speculation to use? synthesized personal items placed in participants' bags. It uses Does the meaning of appearance signal change when machines, speculative X-ray visions to contrast reality with AI-generated assumptions, not humans, observe us? metaphorically highlighting AI's scrutiny and biases. AI-rays promotes discussions on modern surveillance and the future of human-machine reality through a playful, immersive experience exploring AI biases.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.17)
- Asia > China > Guangdong Province > Guangzhou (0.07)
- Asia > China > Hong Kong (0.06)
- (2 more...)
SkillMimic: Learning Reusable Basketball Skills from Demonstrations
Wang, Yinhuai, Zhao, Qihan, Yu, Runyi, Zeng, Ailing, Lin, Jing, Luo, Zhengyi, Tsui, Hok Wai, Yu, Jiwen, Li, Xiu, Chen, Qifeng, Zhang, Jian, Zhang, Lei, Tan, Ping
Mastering basketball skills such as diverse layups and dribbling involves complex interactions with the ball and requires real-time adjustments. Traditional reinforcement learning methods for interaction skills rely on labor-intensive, manually designed rewards that do not generalize well across different skills. Inspired by how humans learn from demonstrations, we propose SkillMimic, a data-driven approach that mimics both human and ball motions to learn a wide variety of basketball skills. SkillMimic employs a unified configuration to learn diverse skills from human-ball motion datasets, with skill diversity and generalization improving as the dataset grows. This approach allows training a single policy to learn multiple skills, enabling smooth skill switching even if these switches are not present in the reference dataset. The skills acquired by SkillMimic can be easily reused by a high-level controller to accomplish complex basketball tasks. To evaluate our approach, we introduce two basketball datasets: one estimated through monocular RGB videos and the other using advanced motion capture equipment, collectively containing about 35 minutes of diverse basketball skills. Experiments show that our method can effectively learn various basketball skills included in the dataset with a unified configuration, including various styles of dribbling, layups, and shooting. Furthermore, by training a high-level controller to reuse the acquired skills, we can achieve complex basketball tasks such as layup scoring, which involves dribbling toward the basket, timing the dribble and layup to score, retrieving the rebound, and repeating the process. The project page and video demonstrations are available at https://ingrid789.github.io/SkillMimic/
- Asia > China > Hong Kong (0.05)
- Asia > China > Guangdong Province > Shenzhen (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (2 more...)
EditFollower: Tunable Car Following Models for Customizable Adaptive Cruise Control Systems
Chen, Xianda, Han, Xu, Zhu, Meixin, Chu, Xiaowen, Tiu, PakHin, Zheng, Xinhu, Wang, Yinhai
In the realm of driving technologies, fully autonomous vehicles have not been widely adopted yet, making advanced driver assistance systems (ADAS) crucial for enhancing driving experiences. Adaptive Cruise Control (ACC) emerges as a pivotal component of ADAS. However, current ACC systems often employ fixed settings, failing to intuitively capture drivers' social preferences and leading to potential function disengagement. To overcome these limitations, we propose the Editable Behavior Generation (EBG) model, a data-driven car-following model that allows for adjusting driving discourtesy levels. The framework integrates diverse courtesy calculation methods into long short-term memory (LSTM) and Transformer architectures, offering a comprehensive approach to capture nuanced driving dynamics. By integrating various discourtesy values during the training process, our model generates realistic agent trajectories with different levels of courtesy in car-following behavior. Experimental results on the HighD and Waymo datasets showcase a reduction in Mean Squared Error (MSE) of spacing and MSE of speed compared to baselines, establishing style controllability. To the best of our knowledge, this work represents the first data-driven car-following model capable of dynamically adjusting discourtesy levels. Our model provides valuable insights for the development of ACC systems that take into account drivers' social preferences.
- North America > United States > Washington > King County > Seattle (0.14)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- (7 more...)
- Transportation > Passenger (1.00)
- Transportation > Ground > Road (1.00)
- Automobiles & Trucks (1.00)
- Consumer Products & Services > Travel (0.86)
Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle "Snow Lion''
Chen, Yingbing, Cheng, Jie, Wang, Sheng, Liu, Hongji, Mei, Xiaodong, Yan, Xiaoyang, Tang, Mingkai, Sun, Ge, Wen, Ya, Cai, Junwei, Xie, Xupeng, Gan, Lu, Chao, Mandan, Xin, Ren, Liu, Ming, Jiao, Jianhao, Liu, Kangcheng, Wang, Lujia
Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle "Snow Lion" In recent years, the rapid evolution of autonomous vehicles (AVs) has reshaped global transportation systems. Leveraging the accomplishments of our earlier endeavor, particularly "Hercules" [1], an autonomous logistics vehicle for transporting goods, we introduce "Snow Lion", an autonomous shuttle vehicle meticulously designed to transform on-campus transportation, providing a safe and efficient mobility solution for students, faculty, and visitors. The main aim of this research is to improve campus mobility through a dependable, efficient, and eco-friendly autonomous transportation solution tailored to meet the diverse requirements of a university setting. This initiative significantly differs from the experiences of "Hercules" [1], as the campus environment presents a notable contrast to the structured environments of highways and urban streets. Emphasizing both security and passenger comfort, the primary focus is Figure 1: This figure illustrates the operational scenario of our on passenger transportation. Achieving this goal involves a autonomous shuttle during its service period at The Hong detailed examination of complex system designs that integrate Kong University of Science and Technology (Guangzhou) trajectory planning adjustments, prioritizing pedestrian safety (referred to as HKUST (GZ)).
- Transportation > Ground > Road (1.00)
- Transportation > Passenger (0.91)